Speech Synthesis with Attitude

نویسندگان

  • Yoshinori Sagisaka
  • Takumi Yamashita
  • Yoko Kokenawa
چکیده

F0 characteristics were analyzed and modeled for the output of speech with natural prosody in communication systems. Lexicons were selected to express speaker's attitude during the human speech generation process. We modeled the prosody using information of constituent lexicons expressing attitude and markedness. Motivated by preliminary observations of prosodic variations in conversational speech, F0 characteristics were quantitatively analyzed using simple phrases consisting of adjectives expressing positive or negative attitude and adverbs expressing different degrees of markedness. Strong positive/negative correlations were observed between the markedness of adverbs and F0 height when an adjective phrase with a positive/negative attitude follows the current adverb. These consistencies have been perceptually confirmed by naturalness evaluation tests. Finally, F0 control is modeled using lexical information expressing positive or negative attitude and markedness.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Study on Unit-Selection and Statistical Parametric Speech Synthesis Techniques

One of the interesting topics on multimedia domain is concerned with empowering computer in order to speech production. Speech synthesis is granting human abilities to the computer for speech production. Data-based approach and process-based approach are the two main approaches on speech synthesis. Each approach has its varied challenges. Unit-selection speech synthesis and statistical parametr...

متن کامل

Single Speaker Acoustic Analysis of Czech Speech for Purposes of Emotional Speech Synthesis

This paper deals with an acoustic analysis of the sets of Czech sentences uttered by single speaker. The data used in this analysis consists of both emotional and neutral sentences. We have been especially interested in some features which are supposed to influence the perception of speech, such as F0, phoneme duration, formant frequencies or energy. The analyzed sets of sentences were composed...

متن کامل

Review on Expressive Speech Synthesis

Expressive speech synthesis is one of the key technologies to achieve more advanced and natural human-computer interaction. A speech that is able to express various kinds of para-linguistic information that is emotions, speaking styles, intentions, emphasis, and attitudes is an expressive speech. Expressive speech synthesis concerns with synthesizing speech and adding various expressions relate...

متن کامل

Emotion and attitude conveyed in speech by means of prosody

In order to adapt a dialogue-system to its users and their needs, the information provided by the speech of the user constitutes useful input for a model of the user. Generating an adequate system’s response involves synthesizing speech in an appropriate expression style. This paper focuses on methodological issues concerning the study of speech variations conveying emotion and attitude. First,...

متن کامل

Modeling the prosody of Vietnamese attitudes for expressive speech synthesis

Attitudes or social affects are strongly implied in interaction processing, and specifically to socio-cultural aspects of language. This paper presents the modeling of attitude to apply in expressive speech synthesis in Vietnamese, an under-resourced tonal language. A prosodic model for Vietnamese attitude is proposed based on the concept of “rendez-vous” between linguistic levels and prosodic ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2004